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DETAILED ACTION 

1 . The text of those sections of Title 35, U.S. Code not included in this action can be found 
in a prior Office action. 

Response to Arguments 

2. Applicant's arguments filed on 01/08/2009 with respect to the claim rejection under 35 
USC 103, have been fully considered but they are not persuasive. 

In response to applicant's arguments (regarding claims 14, 1 and 27) that "D'hoore does 
not use the voice print to translate voice data to text" and "D'hoore does teach or suggest the 
voice print to translate voice data to text as required by the claims" (Remarks: page 2, paragraph 
2 to page4, paragraph 1), the examiner respectfully disagrees with applicant's arguments and has 
a different view of prior art teachings and claim interpretations. 

It is noted that the applicant appears to repeat the same or similar arguments that has been 
responded in previous office action (see the section of Response to Arguments and 
corresponding claim rejection filed on 08/08/2008). Even though the applicant recited the whole 
paragraph of the first aspect of the examiner's arguments (filed on 08/08/2008), the applicant's 
allegation that "examiner is reading subject matter into D'hoor that is simply not taught or 
suggest by the reference" lacks sufficient evidence(s) and/or analysis for support the allegation, 
which failed to fully respond to the previous examiner's arguments for supporting the rejection. 
For example, the applicant totally ignored the examiner's analysis and the corresponding prior 
art teachings that 'speech recognition system is restricted to mapping (translating) the speech 
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onto language specific symbols (text)', 'these voice prints can be used to recognize utterances of 
the trained word by the speaker', and ' . . .it (voice print) can also be used to check or detect the 
identity of the speaker (clearly implying using voice print for speech recognition (i.e. speech-to- 
text) and/or speaker identification' (D'hoore: col. 7, lines 32-55). Particularly, the applicant 
intentionally silenced about the recognized words/symbols being text as result/output of a speech 
recognition, which is common knowledge in the art. Therefore, the new applicant's 
arguments/allegation cannot overcome the previous claim rejection and the examiner's 
arguments, which is recited as following: 

"Firstly, the applicant's arguments (Remarks: page 4, paragraphs 2-3) appear to suggest that the speech 
recognition does not convert (translate) speech (voice data) into text (symbols) only because D'hoore does not 
expressly use the words "translate" and "text". However, this is not persuasive because one of ordinary skill in the 
art would have recognized that these words could be replaced by other equivalent words, such as 
convert/recognize/map/match and symbol/words/written-string, and converting speech into text (speech-to-text) 
would be an implicit/necessary functionality of speech recognition, in nature. It is noted that, D'hoore indeed uses 
the equivalent (or alternative) words to teach that 'speech recognition system is restricted to mapping (translating) 
the speech onto language specific symbols (text)' (D'hoore: col. 7, lines 36-38). D'hoore also teaches that 'the 
system will automatically construct the best possible phoneme or model unit sequence to describe the word (text), 
based on the phoneme model database and the uttered speech', 'this sequence is referred to as a voice print' that 'can 
be used to recognize utterances of the trained word by the speaker', and 'it can also be used to check or detect the 
identity of the speaker' (D'hoore: col. 7, lines 32-55). One of ordinary skill in the art would have readily recognized 
that D'hoore's disclosure teaches the two different features: speech recognition that maps speech to text (symbols) 
and speaker identification (or verification) that identifies a targeted speaker, both using voice print." 



Finally, it is necessarily pointed out that the prior art (such as D'hoore) teachings should 
be treated as a whole, but the applicant failed to do so. It can be seen that the applicant's 
arguments only emphasized D'hoore's speech recognition to recognize pronunciations words or 
produce phoneme models (corresponding to spoken words), but completely refused/ignored to 
face/analyze the recognized words/symbols (corresponding to textural words), which is a basic 
goal/result/output of a speech recognition system in the art. 
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For above reasons, the applicant's arguments are not persuasive and the rejection is 
sustained. 

Claim Rejections - 35 USC § 103 

3. Claims 1-3, 5-16, 18-29, 31-40 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over HEDIN et al. (US 6,185,535 Bl) hereinafter referenced as HEDIN in view of KING (US 
6,532,446 Bl) and D'HOORE et al. (US 6,085,160) hereinafter referenced as D'HOORE. 

Regarding claim 14, HEDIN discloses 'voice control of a user interface to service 
applications' (title), comprising: 

"a device for receiving and transmitting data" (Fig. la and col. 4, lines 25, 'the client part 
101 (corresponding to device) receives speech (data) from user', 'those words (data) . . .sent 
(transmit) to the server part 103'); 

"a computer having a data store coupled thereto, wherein the data store stores data, 
connected to the device" (Fig. 1 and col. 4, lines 46-50, 'the server (a computer) part 103 is 
implemented in a separated processor (computer)' that 'is more powerful (e.g., faster, more 
storage space (data store), etc.)', 'the first digital link 105 for coupling (connecting) the client 
and server part 101, 103 may be wireless or wireline'); and 

"one or more computer programs, performed by the computer" (col. 6, lines 31-35, 
'various embodiments may utilize one or more programmable elements (computer programs) 
executing a stored program to perform a number of functions') for: 

"receiving voice data and a device identifier from the device" (col. 5, lines 20-22, 

'the server. . .uses its own, more powerful ASR to analyze the received speech (voice 
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data)'; col. 4, lines 62-63, 'through WAP URL (interpreted as device identifier', wherein 
WAP also inherently includes device identifier(s) for both sides of communication; also 
see col. 1, lines 21-34); 

"translating the voice data to text", (col. 5, lines 20-22, 'the server. . .uses its own, 
more powerful ASR (speech-to-text) to analyze the received speech (voice data); col. 6, 
lines 6-7, 'the spoken text will either be recognized and converted (translated) to text by 
the ASR in the client 101, or alternatively by the ASR in the gateway/proxy part 107 
(replacing server part 103)'); 

"determining whether to filter the translated text" and "if it is determined that the 
translated text is to be filtered, applying a filter to the translated text", (col. 5, lines 43-55 
'when the data formats are different' determining 'to convert (filter) the data from one 
format to the other' (wherein converting formats is interpreted as filtering, in light of 
specification: see page 6, lines 16-20), 'conversion . . .not only substituting (formatting) 
keywords from one format to another (e.g. from HTML (text) to WML), but also some 
level of filtering to weed out data that cannot be received by the terminal. . . '). 
HEDIN does not expressly disclose "the translated text is returned to the device." 
However, the feature is well known in the art as evidenced by KING who discloses 'server based 
speech recognition user interface for wireless devices' (title), and teaches that 'the symbolic data 
file (corresponding to translated text) is then sent back to the originating mobile device' (col. 3, 
lines 16-19) and 'the processed symbolic data file. . .may be reformatted (filtered) . . .then sent to 
the requesting mobile device or to a designated third party device' (col. 10, lines 32-48). 
Therefore, it would have been obvious to one of ordinary skill in the art at the time the invention 
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was made to modify HEDIN by providing sending the recognized/converted/reformatted 
symbolic data (text) back to the originating device (or a designated third party device), as taught 
by KING, for the purpose (motivation) of improving the usability and commercial viability of 
network for using speech recognition services (KING: col. 2, lines 37-43). 

Further, it is noted that HEDIN in view of KING does not expressly disclose "the voice 
data is translated to text using a voice print" that "is retrieved from datastore based on the 
device identifier". However, the feature using voice print for speech recognition is well known 
in the art as evidenced by D'HOORE who discloses 'language independent speech recognition' 
(title), and teaches that 'the words in the vocabulary of recognizable word may be described by a 
voice print. . . ' (col. 2, lines 1-5), 'speaker dependent training of words, try to find the best 
possible phonetic representation for a particular word based on a few utterances of that word(s) 
by the user', 'speech recognition system. . .mapping (translating) the speech onto language 
specific symbols (text)' and 'these voice prints can be used to recognize utterances of the trained 
word by the speaker' (col. 7, lines 32-55). Further, as stated above, since HEDIN discloses that 
'in a multi-user environment, each user's profile must be stored (datastore)' (col. 1, lines 66-67; 
also col. 8, lines 56-58) and using WAP URL (device identifier) (col. 4, lines 62-63), and 
D'HOORE discloses speech recognition and speaker identification using voice prints (col. 7, 
lines 45-55), one of ordinary skill in the art would have recognized that voice print would be 
readily and properly stored in the corresponding user profile using WAP URL (as device 
identifier) for future retrieving and the storing/retrieving result would be predicable for the 
ordinary skilled person, because voice print is user specific information and user profile is most 
suitable place to keep user specific information. Therefore, it would have been obvious to one of 
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ordinary skill in the art at the time the invention was made to combine teachings of HEDIN in 
view of KING and D'HOORE by providing speech recognition (voice translation) using voice 
prints and storing the voice prints in the corresponding user profile using WAP URL (as device 
identifier) for retrieving, for the purpose (motivation) of trying to find the best possible phonetic 
representation based on user-uttered words (text) and/or better matching the speech of the 
targeted speaker for speech recognition (D'HOORE: col. 7, lines 34-53). 

Regarding claim 15 (depending on claim 14), HEDIN in view of KING and D'HOORE 
further discloses "storing a user profile in a data store connected to the computer", (HEDIN: col. 
1, lines 66-67, 'in a multi-user environment, each user's profile must be stored'; col. 8, lines 56- 
58, 'the RAP server 205' 'may be implemented as a multi-user, central WAP application 
server'). 

Regarding claim 16 (depending on claim 15), HEDIN in view of KING and D'HOORE 
further discloses "user profile comprises a voice print" (HEDIN: col. 1, lines 66-67, 'each user's 
profile must be stored'; D'HOORE: col. 7, lines 45-55, 'voice prints. . . can also be used to check 
or detect the identity of the speaker'; so that it would have been obvious to one of ordinary skill 
in the art at the time the invention was made to modify HEDIN in view of KING and D'HOORE 
by providing a voice print in a user profile, because voice print is user specific information and 
user profile is most suitable place to keep user specific information). 

Regarding claim 18 (depending on claim 14), HEDIN in view of KING and D'HOORE 
further discloses "determining comprises extracting one or more key words from the translated 
text", (HEDIN: col. 5, lines 45-55 'conversion . . .not only substituting keywords from one format 
to another (e.g. from HTML (text) to WML), but also some level of filtering to weed out data 
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that cannot be received by the terminal'; HEDIN: col. 5, lines 24-28, 'the recognized speech (the 
translated text) may consist of commands (keywords) for controlling the server application, in 
which case the command are acted upon' (implying the command is extracted); HEDIN: col. 9, 
lines 59-67, 'if the ASR 307 looks for. . .the phrase "*CALL*", then the ASR 107 will detect 
(corresponding to extract) that the unrecognized isolated word consists of the word "CALL" with 
another unrecognized part following it'). 

Regarding claim 19 (depending on claim 18), HEDIN in view of KING and D'HOORE 
further discloses "a filter is selected based on one or more extracted key words" (HEDIN: col. 5, 
lines 43-55, 'conversion . . .not only substituting (formatting) keywords from one format to 
another (e.g. from HTML (text) to WML), but also some level of filtering to weed out data that 
cannot be received by the terminal. . . '; col. 15, lines 59-66, the devices used for the applications 
can be 'a WAP-enabled phone', 'electronic notepads', or 'windows-based' 'computer'; col. 5, 
lines 24-26, 'the recognized speech (text) may consist of commands (extracted keywords) for 
controlling server application'; col. 9, lines 55-67, 'TP (terminal part) command words 
(extracted keywords, such as "CALL")'; col. 5, lines 59-66, different 'services' and/or 
'applications' that request displaying menus; it would have been obvious to one of ordinary skill 
in the art at the time the invention was made to recognize that the recognized commands 
(keywords) could be associated with different devices or applications that have different textual 
display formats, such as menus, so as to combine the different teachings of HEDIN together by 
providing different text formats for different devices and/or applications based on the recognized 
commands (keywords), for the purpose (motivation) of providing enhanced and extended 
services/applications in advanced mobile networks for the user (HEDIN: col.4, lines 63-65)). 
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Regarding claim 20 (depending on claim 14), HEDIN in view of KING and D'HOORE 
further discloses "applying the filter comprises formatting the translated text", (HEDIN: col. 5, 
lines 45-55 'when the data formats are different. . . convert (filter) the data (translated text) from 
one format to the other', 'conversion . . .not only substituting (formatting) keywords from one 
format to another (e.g. from HTML (text) to WML), but also some level of filtering to weed out 
data that cannot be received by the terminal. . . '). 

Regarding claim 21(depending on claim 20), HEDIN in view of KING and D'HOORE 
discloses "formatting the translated text for an application", (HEDIN: col. 5, lines 50-55, 'if the 
server 109 is an application that is accessible via the Internet. . .pass on to the client 101 only that 
data (text data) that is appropriate'; col. 14, 10-21, 'weather information service' (application); 
col. 15, lines 55-67; interactive voice controlled services (applications)'). 

Regarding claim 22 (depending on claim 20), HEDIN in view of KING and D'HOORE 
further discloses "formatting the translated text for the device", (HEDIN: col. 15, lines 62-63, 
'voice-enabled special devices, such as electronic notepads'). 

Regarding claims 23-24 and 26 (depending on claim 14), the rejection is based on the 
same reason described for claim 14, because it also reads on the limitations of claims 23-24 and 
26 respectively. 

Regarding claim 25 (depending on claim 24), HEDIN in view of KING and D'HOORE 
further discloses "returning the filtered text via an electronic mail message", (KING: col. 2, lines 
63-64, 'email'). 

Regarding claims 1-3 and 5-13, they recite a method. The rejection is based on the same 
reason described for claims 14-16 and 18-26 respectively, because the method claims and 
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apparatus claims are related as apparatus and method of using same, with each claimed element's 
function corresponding to the claimed method step. 

Regarding claims 27-29 and 31-39, they recite an article of manufacture. The rejection 
is based on the same reason described for claims 14-16 and 18-26 respectively, because the 
article claims and apparatus claims are related as apparatus and article of using same, with each 
claimed element's function corresponding to the claimed article element's function. 

Regarding claim 40 (depending on claim 1), HEDIN in view of KING and D'HOORE 
further discloses "the device identifier comprises a unit identifier which identifies a particular 
device operated by a user" (KING: col. 6, line 41-42, 'the contact information . . .(e.g. a phone 
number or a uniform resource indicator (URI), which is read on unit identifier) may be 
embedded in software loaded on the mobile device'; col. 9, lines 53-58, 'each of the mobile 
devices serviced by link server device is assigned an identification (ID) or device (ID)' (read on 
unit identifier) and 'a device ID can be a phone number of the device or an IP address or a 
combination of an IP address and a port number', it would have been obvious to one of ordinary 
skill in the art at the time the invention was made to combine teachings of HEDIN, KING and 
D'HOORE by providing an ID for each device, such as using a phone number, IP address, or 
URI for each of mobile devices, taught by KING, for the purpose (motivation) of identifying 
mobile device to outside entities and corresponding the device with associated user account 
(KING: col. 8, lines 38-43)). 
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Conclusion 

4. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 CFR 
1 .136(a) will be calculated from the mailing date of the advisory action. In no event, however, 
will the statutory period for reply expire later than SIX MONTHS from the mailing date of this 



5. Please address mail to be delivered by the United States Postal Service (USPS) as 
follows: 

Mail Stop 

Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 
or faxed to: 571-273-8300, (for formal communications intended for entry) 
Or: 571-273-8300, (for informal or draft communications, and please label 
"PROPOSED" or "DRAFT") 

If no Mail Stop is indicated below, the line beginning Mail Stop should be omitted from 



Effective January 14, 2005, except correspondence for Maintenance Fee payments, 
Deposit Account Replenishments (see 1.25(c)(4)), and Licensing and Review (see 37 CFR 5.1(c) 
and 5.2(c)), please address correspondence to be delivered by other delivery services (Federal 
Express (Fed Ex), UPS, DHL, Laser, Action, Purolater, etc.) as follows: 

U.S. Patent and Trademark Office 

Customer Window, Mail Stop 

Randolph Building 

Alexandria, VA 223 14 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to QI HAN whose telephone number is (571)272-7604. The 
examiner can normally be reached on M-TH:9:00- 19:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571)-272-7602. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



QH/qh 

August 10, 2009 
/QI HAN/ 

Primary Examiner, Art Unit 2626 



